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The  protein  PEBl  (28  kOa)  is  a  common  antigen  and 
a  major  cell  adherence  molecule  of  Campylobacter  je¬ 
juni  and  Campylobacter  eoli.  We  created  a  bank  of 
chromosomal  DNA  fragments  of  C.  jejuni  strain  81- 
176  using  Xgtll.  Screening  this  bank  in  Escherichia 
coli  Y1090  cells  with  antibody  raised  against  purified 
PEBl  enabled  us  to  isolate  and  to  purify  a  clone  with 
a  2.6-kilobase  insert  expressing  an  immunoreactive 
protein  of  28  kDa.  DNA  sequencing  revealed  that  the 
insert  contains  three  complete  and  two  partial  open 
reading  frames  (ORFs),  designated  6'  to  3'  as  ORFs  A— 
E.  The  peblA  gene  (ORF  D)  contains  780  bases  encod¬ 
ing  a  259-residue  polypeptide  having  a  calculated  mo¬ 
lecular  mass  of  28,181  Da.  The  peptide  sequence  start¬ 
ing  at  residue  27  matches  that  determined  from  amino- 
terminal  sequencing  of  mature  PEBl  from  C.  jejuni. 
The  first  26  residues  contain  typical  signal  peptidase  I 
and  II  cleavage  sites.  The  deduced  amino  acid  compo¬ 
sition  and  pi  of  the  recombinant  mature  protein  are 
similar  to  those  determined  for  purified  PEBl.  Gene 
bank  searches  indicated  significant  overall  homology 
of  pehlA  and  ORF  C  with  operons  for  amino  acid 
transport  systems  in  other  Gram-negative  organisms. 
peblA  is  homologous  to  the  binding  components  of 
systems  such  as  g'/nH  (27.87o)  and  hisj  (28.9%), 
whereas  ORF  C  has  nearly  607o  identity  to  grfnQ  and 
hisP.  Thus,  PEBl  could  be  involved  both  in  binding  to 
intestinal  cells  and  in  amino  acid  transport. 


Campylobacter  jejuni  and  the  closely  related  species  Cam¬ 
pylobacter  coli  are  important  causes  of  diarrheal  disease  in 
humans  worldwide  (1-3).  These  organisms  exhibit  consider¬ 
able  serotjpic  diversity  with  >50  0  antigens  (4)  and  a  similar 
number  of  heat-labile  antigens  (5).  However,  evidence  from 
natural  infections  in  developing  countries  (6,  7),  from  raw 
milk  drinkers  in  the  United  States  (8),  and  from  experimental 
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infections  in  volunteers  (9)  and  monkeys  (10)  suggests  that 
immunity  to  C.  jejuni  is  induced  by  recurrent  exposure;  thus, 
it  may  be  possible  to  develop  a  vaccine  against  C.  jejuni 
enteritis.  We  have  previously  identified  two  antigenic  pro¬ 
teins,  PEBl  (28  kDa)  and  PEB3  (30  kDa),  from  C.  jejuni  that 
are  commonly  recognized  by  convalescent  sera  from  patients 
with  sporadic  C.  jejuni  diarrhea  (11)  and  that  may  be  vaccine 
candidates. 

PEBl  is  conserved  in  all  C.  jejuni  and  C.  coli  isolates  and 
is  located  on  the  surface  of  C.  jejuni  cells  as  identified  by 
immunogold  electron  microscopy  (12),  indicating  that  it  is  a 
good  target  for  the  immune  system  (11).  PEBl  (CBFl)  plays 
^  major  role  in  adherence  to  HeLa  cells,  suggesting  that  it 
may  be  involved  in  Campylobacter  colonization  of  the  intestine 
(13).  PEBl  is  a  lysine-rich  basic  (pi  8.5)  protein  without 
methionine  at  its  amino  terminus,  suggesting  that  a  leader 
peptide  is  cleaved  during  PEBl  maturation  (11);  the  amino 
terminus  of  PEB 1  has  no  significant  homology  to  other  known 
proteins  (11).  We  thus  undertook  the  molecular  cloning  and 
sequencing  of  the  gene  encoding  PEBl  to  determine  its  pri¬ 
mary  sequence,  to  understand  its  post-translational  modifi¬ 
cation  and  intracellular  transport,  and  because  large-scale 
production  of  recombinant  PEBl  in  Escherichia  coli  will 
facilitate  functional  and  immunological  studies. 

We  report  the  cloning  and  sequencing  of  the  PEBl  struc¬ 
tural  gene  (which  we  name  peblA)  from  C.  jejuni  strain  81- 
176.  The  deduced  amino  acid  sequence  indicated  that  PEBl 
has  a  cleaved  26-amino  acid  leader  peptide;  the  complete 
molecule  exhibits  significant  hc^ology  to  Enterobacteriaceae 
glutamine-binding  protein  (glnii)  (14),  lysine/arginine/orni- 
thine-binding  protein  (LAOl^lS),  and  histidine-binding  pro¬ 
tein  (hisJ)  (16)  and  may  be  part  of  a  homologous  operon. 

E.XHERI.MENTAL  PROCEDURES 

''Enzymes  and  C/icmicafj— Restriction  endonucleases,  calf  intes¬ 
tinal  alkaline  phosphatase,  IPTC,  and  affiiuty-purifled  goat  antibody 
to  rabbit  IgC  conjugated  with  alkaline  phosphatase  were  obtained 
from  Boehringer  Mannheim.  Other  important  reagents  were  bacte¬ 
riophage  T4  DNA  ligase  (New  England  BioLabs,  Inc.),  nitrocellulose 
BA-85  (Schleicher  &  Schuell),  5-bromo-4-chloro-3-indoIyI-3-o-gal' 
actoside  (Sigma),  and  Sepharose  CL-2B  (Pharmacia  LKB  Biotech¬ 
nology  Inc.). 

Bacterial  Strains— C.  Jejuni  strain  81-176  (ATCC  55026),  used  tor 
PEBl  production  and  genomic  DNA  preparation,  was  isolated  from 
an  outbreak  of  Campylobacter  diarrhea  and  has  been  demonstrated 
to  be  a  virulent  strain  in  monkeys  (10)  and  in  volunteers  (9).  E.  coli 
strains  Y1088,  Y1089,  and  Y1090  have  been'described  (17),  and  XLl- 


'  The  abbreviations  used  are:  LAO,  lysine/arginine/ornithine-bind- 
ing  protein;  IPTG,  isopropyI-l-thio-3-D-galactopyranoside;  kb,  kilo- 
base(s);  bp,  base  pair(s);  PCR,  polymerase  chain  reaction;  ORF,  open 
reading  frame. 
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Blue  (StraUgene,  La  Jolla,  CA)  was  used  for  transformations. 

Prtparation  of  X  Bank  uiitk  Insert  DNA  from  C.  jejuru — A  genomic 
library  was  prepared  from  C.  jejuni  strain  81-176  as  previously  de¬ 
scribed  (18).  In  brief,  purified  DNA  (800  vg)  was  sheared  by  aonica- 
liiin.  DNA  fragments  of  1.0-10  kb  were  isolated  and  ligated  to  £coRi 
linkers  mid  then  to  dephosphorylated  Xgtll  arms.  The  ligation  mix¬ 
ture  wus  added  to  a  X-packaging  mixture  (Gigapack,  Stratagene)  and 
titered  on  Y1088  cells. 

Immunological  AfetAods— Polyclonal  antiserum  to  PEBl  purified 
from  strain  81-176  was  raised  in  a  hyperimmunized  rabbit  as  previ- 
(lu.sly  described  (11).  This  serum  recognizes  heterologous  PEBl  an¬ 
tigens  in  all  C.  jejuni  and  C.  eoli  strains,  but  lias  essentially  no 
reactivity  against  other  C.  jejuni  or  C.  eoli  antigens.  Immunological 
screening  of  the  C.  jejuni  library  was  performed  as  previously  de¬ 
scribed  by  Gotschlich  et  oL  (19).  SOS-polyacrylamide  gel  electropho¬ 
resis  was  performed  on  lysates,  proteins  were  transferred  to  nitrocel¬ 
lulose  membranes,  and  the  blots  were  developed  by  an  immunoenzy- 
matic  method  (11). 

Subcloning  and  Physical  Mapping  of  Insert— Tor  expression  and 
mopping  of  the  insert,  the  original  clones  in  Xgtll  were  ^gested  with 
EcoRI,  and  the  inserts  were  separated  in  low-melting-point  agarose 
and  ligated  into  the  EcoRl  site  of  phosphatase-treated  pUCl9.  The 
ligation  mixture  was  used  to  transfbrm  competent  XLl-Blue  E.  eoli 
celts,  and  carbenicillin-resistant  transformants  were  isolated  (20). 
Recombinant  plasmids  were  purified  (21)  and  digested  with  restric¬ 
tion  endonucleases  (22).  Western  blotting  was  performed  with  anti- 
.scrum  to  PEBl  to  identify  expressed  proteins. 

DNA  Analysis — The  C.  jejuni  DNA  insert  in  pUCl9  (nPBllO)  was 
digested  with  exonuclease  111  to  generate  a  series  of  nested  deletion 
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Ftc.  1.  Immunoblot  of  lysates  of  lysogenized  E.  eoli  Y1089 
cells  producing  recombinant  C.  Jejuni  antigens.  Lane  a,  cells  of 
C.  jejuni  strain  81-176;  lane  b,  cells  of  £.  eoli  strain  Y1089  conUining 
Xgtll  without  an  insert;  lane  e,  cells  of  lysogenic  clone  1;  lane  d,  cells 
of  lysogenic  clone  2.  £.  eoli  cells  in  laries  b-d  were  cultured  overnight 
in  the  presence  of  2  mM  IPTG  to  induce  expression  of  genes  down¬ 
stream  of  the  laeZ  promoter  in  Xgtll.  A  band  migrating  at  —28  kOa 
(indicated  by  arrow)  was  recognized  by  rabbit  antiserum  to  purified 
PEBl  (1:10,000)  (1 1)  in  clones  1  and  2,  but  not  in  the  strain  harboring 
Xgtll  alone. 

PEBl  produced  by  C.  Jejuni  strain  81-176.  The  DNA  from 


mutants  (23)  and  also  by  bidirectional  endonuclease  deletion  based 
on  the  restriction  map.  The  nucleotide  sequence  of  plasmid  DNA  was 
determined  on  both  strands  by  the  dideoxynucleotide  chain  termi¬ 
nation  reaction  (24)  and  was  analyzed  with  DN'ASTAR  software  to 
define  open  reading  frames  and  restriction  sites.  The  amino  acid 
•sequence  of  the  deduced  gene  product  was  analyzed  for  hydrophobic- 
ity  with  the  algorithm  of  Kyte  and  Doolittle  (25)  and  for  secondary 
structure  with  the  algorithm  of  Gamier  et  al.  (26).  Nucleic  acid  and 
amino  acid  homologies  to  GenBank  and  EMBL  data  bases  were 
examined  with  Nuscan  and  ProScan  programs  using  the  method  of 
Pearson  and  Lipman  (27). 

Amplification  of  peb  I A  Gene  from  C.  jejuni  Isolates  Using  Polym¬ 
erase  Chain  Reaction  (PCR) — Oligonucleotides  were  synthesized  on  a 
MiHigen  7500  automated  DNA  synthesizer.  PCR  was  performed  with 
bacterial  chromosomal  DNA  concentrations  of  1  ng/ul  and  with 
primer  annealing  at  54  *C  for  1  min.  extension  at  72  'C  for  2  min, 
and  denaturation  at  94  'C  for  1  min  for  30  cycles.  PCR  products  were 
electrophoresed  on  1%  agarose  gel  and  purified  (GeneCIean,  BiolOl, 
Inc.,  La  Jolla,  CA)  for  restriction  digestions. 

Southern  Blot  Ano/ysis— Restriction  fragments  generated  by 
Hindlll  digestion  of  whole  chromosomal  DNA  on  0.7%  agarose  gel 
were  transferred  to  nylon  membranes  after  denaturation  and  neu- 
tralization  according  to  the  method  of  Southern  as  described  (22)ir 
Probe  was  labeled  with  (”PJdATP  by  random  priming,  and  hybrid!-  '' 
Zntion  wn.s  performed  in  50%  formamide  buffer  overnight  at  42  ’C 
(22). 


each  clone  was  purified,  and  the  inserts  were  excised  following 
EcoRI  digestion  and  resolved  by  agarose  gel  electrophoresis. 
Each  clone  contained  a  single  insert  of  2.6  kb. 

Characterization  of  C.  jejuni  Insert  DNA— The  2.6-kb  insert 
in  clone  1  was  subcloned  into  pUCl9,  and  transformanU  were 
screened  by  immunoblot  using  antiserum  to  PEBl.  The  plas¬ 
mids  from  two  transformants  producing  immunoreactive  mol¬ 
ecules  migrating  at  -28  kDa  had  the^ert  im opposite  ori¬ 
entations  and  were  designated  pPB)1^ig.  2)  and  pPB219. 
Both  strains  expressed  the  molecule  ih'the  absence  of  IPTG, 
but  IPTG  enhanced  expression  in  pPBll9,  indicating  an 
effect  mediated  by  the  0-galactosidase  promoter  of  the  vector. 
Hindlll  and  Ncol  deletion  mutants  of  pPBll9  (pPB203  and 
pPBll)  both  expressed  the  full-length  28-kDa  protein  (Fig. 

Nucleotide  Sequence  of  peblA— The  nucleotide  sequence  of 
the  2687-kb  insert  determined  according  to  the  strategy  shown 
in  Fig.  2  yielded  three  complete  and  two  partial  open  reading 
■,fr5mes'(ORFs),  which  were  designed  5'  to  3'  as  ORFs  A-E 
(Fig.  3).^ORF  A  is  a  partial  ORF  encoding  21  amino  acids, 
ending  with  TAA  at  positions  65-67.  Between  ORFs  A  and 
B.  there  are  15  nucleotides  containing  a  putative  (28)  ribo- 
somal  binding  site  (AGGA,  positions  72-75)  7  nucleotides 


RESULTS 

Detection  of  Recombinant  Bacteriophage  Expressing  C.  je¬ 
juni  PEBl  Protein— The  Xgtll  bank  of  genomic  DNA  from 
C.  jejuni  strain  81-176  in  Y1088  cells  yielded  8.2  X  10*  plaque¬ 
forming  units  with  a  76.2%  insertion  rate.  After  amplification 
in  Y1090  cells,  the  bank  was  screened  with  an  E.  coii-absorbed 
rabbit  antiserum  to  purified  PEBl  from  strain  81-176  (11)  to 
detect  expression  of  recombinant  clones  bearing  PEBl  anti¬ 
gens.  Two  positive  plaques  were  detected  from  a  10-cm  di¬ 
ameter  Petri  dish  containing  —10*  plaques.  These  two  clones 
were  plaque-purified  and  amplified  on  Y1090  cells  to  provide 
high  titer  stocks  for  further  study. 

Characterization  of  Recombinant  Protein  in  hgtll — To  fur¬ 
ther  characterize  these  two  clones,  we  constructed  lysogens  of 
each  in  E.  eoli  Y1089  cells.  The  lysogens  grown  with  IPTG 
and  analyzed  by  immunoblot  with  antiserum  to  PEBl  both 
^^roduced  an  immunoreactive  product  migrating  at  —28  kDa 
(Fig.  1),  essentially  identical  in  apparent  molecular  mass  to 


upstream  from  the  ATG  codon  initiating  ORF  B.  No  putative 
transcriptional  terminator  was  found  in  this  region,  suggest¬ 
ing  that  ORFs  A  and  B  may  be  cotranscribed.  ORF  B  is  795 
nucleotides,  encoding  a  264 -residue  polypeptide,  ending  with 
TAA  at  positions  875-877.  Following  ORF  B  is  a  128-nucleo- 
tide  noncoding  sequence  containing  an  inverted  repeat  that 
could  form  a  stem-loop  structure  (AG  =  -9.0)  (Fig.  3)  (29). 
ORF  C  begins  with  an  unusual  start  codon  (TTG)  at  positions 
1006-1008  (30, 31).  A  putative  ribosomal  binding  site  (AGGA) 
is  located  6  nucleotides  upstream  from  the  TTG  codon.  There 
is  a  sequence  (TAAAAT)  resembling  the  —10  consensus  se¬ 
quence  in  E.  eoli  (TAtAaT)  that  is  35  bases  upstream  from 
the  ribosomal  binding  site,  and  20  nucleotides  farther  up¬ 
stream,  there  is  a  sequence  (TTGAAG)  resembling  the  -35 
consensus  sequence  in  E.  eoli  (TTGACa)  (32).  ORF  C  is  729 
nucleotides,  encoding  a  polypeptide  of  242  amino  acids,  ending 
at  positions  1732-1734  with  TAA.  ORF  D  follows  ORF  C 
after  a  21-nucleotide  noncoding  region.  A  putative  ribosomal 
binding  site  (AGGA)  is  located  6  nucleotides  upstream  from 
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Fio.  2.  Rettrlction  map  of 
pPB119  and  sequencinK  atrategy 
for  peblA  gene.  Restriction  sites  are 
shiiwn  iiImivc  the  2.6-kb  insert  {Ji,  EcoRl", 
H,  HmdIIl;  N,  Nco\).  Three  complete 
OKFs,  B-D,  and  two  partial  ORFs,  A 
and  B,  are  indicated  ImIow  the  insert. 
The  lorift  arrow  represents  the  direction 
of  ironscription  of  peblA.  pPB203  and 
pPH  1 1  are  deletion  mutants  of  pPB  1 19. 
Solid  arrows  represent  sequences  ob¬ 
tained  from  deletion  mutants,  and  dotted 
arrows  from  primer  sequencing. 
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the  start  codon  (ATG)  for  ORF  D  at  position  1756.  ORF  D 
(pe61A)  is  780  nucleotides,  terminated  by  TAA  at  positions 
2533-2535,  and  encodes  a  polypeptide  of  259  amino  acids  with 
a  molecular  mass  of  28.18  kDa.  One  base  downstream  of  ORF 
D,  truncated  ORF  E  begins;  only  the  first  50  amino  acids  of 
this  ORF  can  be  deduced  from  the  insert.  Since  no  potential 
transcriptional  terminators  were  found  among  ORFs  C-E,  it 
is  possible  that  these  ORFs  are  cotranscribed  using  a  common 
promoter  located  upstream  from  ORF  C.  No  ORF  >300 
nucleotides  was  found  in  the  complementary  strand. 

Signal  Sequence  of  PEBl — The  amino-terminal  amino  acid 
sequence  of  mature  native  PEBl  (11)  is  identical  to  the 
deduced  sequence  from  ORF  D  beginning  at  residue  27,  indi¬ 
cating  that  mature  PEBl  has  a  26-re3idue  cleaved  signal 
sequence.  Overall,  the  26-residue  signal  peptide  has  a  calcu¬ 
lated  molecular  weight  of  2742  and  is  similar  in  structure  to 
a  typical  signal  peptide  (33,  34).  Arg  and  Lys  at  positions  4 
and  5,  respectively,  form  its  positively  charged  head;  the  next 
9  residues  form  a  hydrophobic  core,  followed  by  Gly,  an  o- 
helix  breaker,  10  residues  upstream  from  the  cleavage  site.  A 
typical  structure  for  signal  peptidase  I  cleavage  (33, 34)  occurs 
between  Ala”  and  Ala”,  followed  by  negatively  charged  Glu”. 
Immediately  following  the  cleavage  site,  8  of  13  residues  are 
polar.  A  second  conserved  signal  peptidase-processing  struc¬ 
ture  (Leu'*-Gly‘*-Ala”-Cys'‘)  homologous  to  signal  peptidase 
II  cleavage  sites  was  located  in  which  Cys  is  essential.  Leu  is 
highly  conserved,  and  small  amino  acids  between  Leu  and 
Cys  such  as  Gly,  Ala,  Ser,  and  Val  are  preferred  (35). 

Amino  Acid  Composition  and  Codon  Usage — Although  ORF 
D  (peblA)  encodes  a  deduced  protein  of  28.2  kDa,  the  deduced 
molecular  mass  of  mature  PEBl  (27th  to  259th  amino  acid 
residue)  is  25.5  kOa.  The  pi  (8.51)  of  the  deduced  mature 
protein  is  nearly  identical  to  the  experimentally  derived  pi 
(8.5)  of  the  mature  protein  (11).  The  amino  acid  composition 
determined  for  the  mature  protein  from  C.  jejuni  strain  81- 
176  (11)  and  that  deduced  for  peblA  (27th  to  259th  residue) 
are  in  general  agreement,  with  a  few  exceptions.  The  deduced 
mature  protein  contains  39  basic  amino  acid  residues  (33  Lys, 

4  Arg,  and  2  His)  and  35  acidic  residues  (22  Asp  and  13  Glu), 
indicating  a  net  positive  charge  consistent  with  both  the 
determined  and  predicted  pi  values.  Cysteine  is  not  found 
either  in  PEBl  or  in  the  deduced  mature  peblA  gene  product. 
Examination  of  the  deduced  amino  acid  sequence  indicates 
relatively  uniform  distribution  of  positive  and  negative 
charges  over  the  length  of  the  molecule.  Hydrophobic  amino 
acids  with  no  polar  side  chains  represent  50%  of  the  residues. 
The  GC  content  of  peblA  is  31.66%;  A  or  T  represents  88% 
(229/260)  of  the  third-position  nucleotides.  This  GC  content 
and  codon  usage  are  consistent  for  Campylobacter  DNA  (18, 


36-38),  but  as  expected,  are  significantly  different  from  E. 
colt  (39). 

Secondary  Structure  of  peblA  Gene  Product — Secondary 
structure  calculations  (26)  for  the  deduced  mature  protein 
indicate  that  78%  of  the  233  residues  are  in  the  a-helical 
conformation.  The  deduced  signal  peptide  is  entirely  in  the 
a-helical  conformation.  Using  the  method  of  Kyte  and  Doo¬ 
little  (25),  the  only  major  hydrophobic  region  (Leu’-Ala”)  is 
located  in  the  leader  peptide  (data  not  shown);  the  other 
minor  hydrophobic  regions  are  randomly  distributed  over  the 
entire  molecule,  but  none  is  sufficiently  long  for  membrane 
spanning. 

Homologies  of  PEBl  to  Other  Proteins~A  search  of  the 
National  Biomedical  Research  Foundation  (PIR  21.0)  showed 
27.8%  identity  of  the  deduced  pcblA  product  to  E.  coli  gluta¬ 
mine-binding  protein  precursor  (glnH)  (14),  22.9%  identity 
to  Salmonella  typhimurium  LAO  (15),  and  28.9%  identity  to 
S.  typhimurium  histidine-binding  protein  {hisJ)  (16,  40). 
Searches  of  a  variety  of  regions  of  PEBl  show  no  significant 
homologies  to  other  known  proteins.  The  amino  acid  compo¬ 
sition,  molecular  mass,  and  secondary  structure  are  similar 
between  PEBl  and  glnH,  hisJ,  and  LAO;  however,  PEBl  is 
significantly  more  basic  than  these  other  proteins.  A  pairwise 
alignment  of  the  primary  sequence  did  not  show  consecutive 
identical  regions  oC^i-aminq  acid  residues  between  PEBl 
and  glnH  or  LACC^Fig.  4).  The  relationship  of  PEBl  with 
amino  acid-binding  proteins  was  further  confirmed  by  the 
homology  of  ORF  C  to  other  members  of  operons  for  gluta¬ 
mine  and  histidine  transport  systems.  ORF  C^hares  nearly 
50%  identity  with  the  proteins  ginQ  and  /iisI^(Fig.  5),  which 
serve  as  membrane  receptors  for  the  binding'proteins  glnH 
and  hisJ,  respectively.  Both  glnQ  and  hisP,  like  ORF  C,  begin 
with  uncommon  start  codons  such  as  TTG  and  GTG  (14, 15, 
41).  ORF  E,  the  third  member  of  the  putative  PEBl  operon, 
did  not  share  significant  homology  with  other  known  proteins 
in  the  limited  sequence  that  was  identified. 

ORF  B  shares  an  overall  22-24%  identity  with  a  number  of 
heat  shock  proteins  belonging  to  the  hsp70  (42-45)  and  hsp90 
(46)  families,  such  as  the  78-kDa  glucose-regulated  protein  of 
yeast,  which  facilitates  the  assembly  of  multimeric  protein 
complexes  inside  the  endoplasmic  reticulum  and  binds  im¬ 
munoglobulin  heavy  chain  (42, 45).  Homology  was  found  also 
between  mouse  brain  microtubule-associated  pr(^lzs-(47}'an(| 
ORF  A  (45%  identity)  and  ORF  B  (24%  identi^(Fig.  6). 

Conservation  of  peblA  Gene  among  C.  jejuni  Strains — We 
next  sought  to  determine  the  conservation  of  peblA  among 
Campylobacter  strains  by  Southern  hybridixation  since  PEBl 
is  apparently  present  in  all  C.  jejuni  strains  examined,  and  a 
closely  related  molecule  is  found  in  C.  coli  (11).  Initial  analyses 
used  as  the  probe  a  702-bp  PCR  product  from  pPBll9 


A 


C.  jejuni  PEBl,  a  Binding  Component  Homolog 


Kic.  Nucleotide  and  deduced 
umino  acid  sequences  of  2687*bp 
pPOllS  fragment  containing 
peblA.  The  DNA  sequence  was  deter¬ 
mined  fur  hulh  strands  as  described  un¬ 
der  “I^xpcrimental  Procedures.’  The 
three-letter  amino  acid  code  and  the  ter¬ 
mination  codon  TAA  (Och)  are  indicated 
above  each  triplet  nucleotide  codon.  Nu¬ 
cleotides  for  pPBl  19  and  amino  acids  for 
each  open  reading  frame  are  numbered 
on  the  right  of  each  line.  The  ribosomal 
binding  sites  (Shine-Dalgarno  (S.D.)) 
and  the  putative  promoter  are  indicated, 
and  the  boldface  portions  of  the  DNA 
sequence  represent  inverted  repeat  se¬ 
quences  that  may  serve  as  a  transcrip¬ 
tional  terminator.  The  boldface  amino 
acid  sequence  in  ORF  D  was  determined 
by  amino-terminal  sequencing  of  mature 
PEBl  from  C.  jejuni  (11), 


oar  A 

Hta  Lau  Lya  Fro  Mat  Sar  Lau  Lya  clu  Its  Lya  Lya  Clu  Ila  t« 

G  CAT  TTA  AAA  CCT  ATG  ACC  TTA  AAA  CAA  ATT  AAA  AAA  CAA  ATT  4] 

ORF  B 

Val  Asn  Pha  Xla  Aap  Gin  Aap  Och  Hat  Clu  Lya  ) 

CTA  AAT  TTT  ATT  CAT  CAC  CAT  TAA  TAAAAGCAAAATTCC  ATG  CAA  AAA  ft 

S.D. 

Lya  11a  Thr  Fro  Sar  Clu  Lau  Clu  Lau  Aan  Clu  Pha  Ila  Lya  Ila  II 

AAA  ATA  ACT  CCT  ACC  CAA  TTC  CAA  CTT  AAT  CAA  TTT  ATA  AAA  ATT  13* 

Ila  Asn  Clu  Mat  Sar  Cly  Ila  Aap  Lau  Thr  Aap  Lya  Lya-Asn  Ila  3} 

ATC  AAC  CAA  ATC  ACT  CCT  ATT  CAT  TTA  ACC  CAT  AAA  AAA  AAT  ATA  111 

Lau  Ala  Lau  Lya  Lau  Aan  Lya  Pha  Lau  Clu  Cly  Thr  Aan  Thr  Lya  4| 

CTA  CCT  TTA  AAC  TTC  AAT  AAA  TTT  CTT  CAA  CCA  ACT  AAT  ACT  AAA  I3S 

Asn  Pha  Sar  Clu  Pha  Lau  cly  Lya  Lau  Lya  Sar  Aan  Arg  Cln  Lau  63 

AAT  TTT  TCC  CAA  TTT  TTC  CCA  AAA  TTA  AAA  ACC  AAT  ASA  CAA  CTT  171 

Lya  Cln  Clu  Thr  Lau  Asp  Pha  Val  Thr  Ila  Cly  Clu  Thr  Tyr  Pha  71 

AAA  CAA  CAA  ACT  TTA  CAT  TTT  CTA  ACC  ATA  CCT  CAA  ACT  TAT  TTT  316 

Lau  Arg  Clu  Lau  Ala  Cln  Lau  Lya  Clu  Ila  Ila  Tyr  Tyr  Ala  Lya  93 

TTA  ACA  CAA  TTC  CCT  CAA  TTC  AAA  CAA  ATA  ATT  TAT  TAT  CCC  AAA  361 

Sar  Lau  clu  Lya  Arg  Val  Asn  Ila  Lau  Sar  Ala  Pro  Cys  Sar  Sar  101 

ACC  TTA  CAA  AAC  ACA  CTA  AAT  ATC  CTA  ACC  CCC  CCT  TCT  TCA  ACT  406 

Cly  Clu  Clu  Val  Tyr  Sar  Lau  Ala  Lau  Lau  Ala  Ala  Cln  Aan  Pha  133 

CCA  CAA  CAA  CTA  TAT  TCT  TTC  CCA  TTA  TTC  CCT  CCA  CAC  AAT  TTT  451 

Ila  Lya  Aap  Hat  Tyr  Ila  Lau  Cly  Val  Aap  Ila  Aan  Sar  Sar  Val  131 

ATT  AAA  CAT  ATG  TAT  ATT  TTA  CCC  CTT  CAT  ATT  AAT  TCA  ACT  CTC  49  6 

Ila  Clu  Lya  Al^  Lya  Lau  Cly  Lya  Tyr  Cln  Cly  Arg  Thr  Lau  u>r.  153 

ATT  CAA  AAA  CCA  AAA  CTT  CCA  AAA  TAT  CAA  CCA  ACA  ACT  TTA  CAC  54  1 

Arg  Lau  Sar  Clu  Sar  Clu  Lys  Arg  Arg  Pha  Pha  Lau  Clu  Sar  Clu  16S 

CCA  TTC  ACC  CAC  ACT  CAA  AAA  ACA  ACC  TTT  TTT  TTA  CAA  ACC  CAA  SS6 

Aap  Lya  Pha  Tyr  Thr  Ila  Asn  Lys  Asn  Clu  Lau  Cys  Thr  Cys  Lya  183 

CAT  AAA  TTT  TAT  ACT  ATT  AAT  AAA  AAT  CAC  CTT  TCT  ACT  TCT  AAA  631 

Pha  Clu  Leu  Cys  Asn  Val  Pha  Clu  Clu  Lys  Pha  Sar  Arg  Leu  Cly  •  191 

TTT  CAA  CTT  TCC  AAT  CTT  TTT  CAA  CAA  AAA  TTT  TCA  ACA  TTC  CCA  676 

Lya  Pha  Aap  lie  Ila  Ala  Sor  Arg  Asn  Hat  lie  Ila  Tyr  Pha  Asp  213 

AAA  TTT  CAT  ATT  ATA  CCT  TCT  ACA  AAT  ATC  ATT  ATT  TAT  TTT  CAT  721 

Hla  Clu  sar  Lys  Lau  Lya  Lau  Mat'clu  Arg  Pha  Hia  Arg  Ila  Lau  221 

CAT  CAA  TCA  AAA  CTA  AAA  CTT  ATC  CAC  ACC  TTT  CAT  ACA  ATT  TTA  7  66 

Asn  Asp  Lys  cly  Arg  Lau  Tyr  Val  Cly  Asn  Ala  Aap  Lau  Ila  Pro  243 

AAT  CAT  AAA  CCA  ACC  CTT  TAT  CTT  CCC  AAT  CCT  CAT  TTA  ATT  CCA  111 

Clu  Thr  lie  Tyr  Pha  Lys  Lys  lie  Ser  Leu  Cln  Clu  Val  Pha  Thr  251 

CAC  ACT  ATT  TAT  TTT  AAA  AAC  ATT  TCT  CTC  CAA  CAC  CTC  TTT  ACT  156 

Mat  Lys  Lya  Tyr  Lya  Pha  Och  264 

ATC  AAA  AAC  TAT  AAA  TTC  TAA  AAATTAeTAAAACTTACACTTTCCAAATTTA  901 

-35  -10 

TT  AOIAAAAATAACTTACATTTTC  AACT  ACTTTTCTrT  ATTTAATC  ATAJLAATAATrrC  967 


ORF  C 

Met  Ila  Clu  Lau  Lya  5 

AATTAATTTTATATTTACCTAAAAATAAACCAAAAAAC  TTC  ATT  CAA  TTA  AAA  1020 

S.D. 


Asn 

Val 

Asn 

Lya  Tyr  Tyr 

Cly 

Thr 
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His 
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Lau 

Lys 

He 

pha 
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He 
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AAT 

CTT 
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CTT 
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CCA 

1110 
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CAA 

CAA 

CTT 

ACT 

TCA 

CCA 

CAC 

CTC 

CTA 

CTT 

AAC 

AAT 

CTT 

CTT 
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1200 
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a 
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clu 

Clu 

Thr 

Als 
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Lsu 

Lya 

Vsl 

Vsl 

Cly 

Leu 

135 

CAA 

CCT 

CAA 

CAA 

ACA 

CCT 

TTT 

AAC 

TAT 

TTA 
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CTA 

CCT 
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1380 

L«u 
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Als 
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Als 
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Cln  ein  Arq  Val  Ala  I  la  Ala  Arf  Sar  Uu  Cys  Thr  Lya  Lyt  Fro  ISS 

CAA  CAA  CCC  CTT  CCT  ATA  CCA  ACA  TCA  CTT  TCT  ACT  AAA  AAA  CCC  UIO 

Tyr  Ila  Lau  Pha  Aap  Clu  Pro  Thr  Sar  Ala  Lau  Asp  pro  Clujhr  170 

TAT  ATT  TTA  TTT  CAT  CAA  CCT  ACT  TCA  CCC  CTT  CAT  CCA  CAA^CC  ISIS 

Ila  Cln  Clu  Val  Lau  Aap  Val  Mat  Lya  Clu  Ila  Sar  Hia  Cln  Sar  ISS 

ATA  CAA  CAC  CTT  TTA  CAT  CTA  ATC  AAA  CAA  ATT  TCA  CAT  CAA  ACC  1S«0 

Aan  Thr  Thr  Mat  Val  Val  Val  Thr  Hla  Clu  Mat  Cly  Pha  Ala  Lya  200 

AAT  ACT  ACC  ATC  CTC  CTT  CTT  ACA  CAC  CAA  ATC  CCT  TTT  CCA  AAA  ISOS 

Clu  Val  Ala  Asp  Arq  Ila  Ila  Pha  Mat  Clu  Aap  Cly  Ala  Ila  Val  21S 

CAA  CTA  CCA  CAT  ACC  ATT  ATT  TTT  ATC  CAA  CAT  CCT  CCT  ATT  CTC  1«S0 

Clu  Clu  Aan  Ila  Pro  Sar  Clu  Pha  Pha  Sar  Aan  Pro*  Lya  Thr  Clu  210 

CAA  CAA  AAT  ATT  CCT  ACT  CAA  _TTT  TTC  TCA  AAT  CCA  AAA  ACT  CAA  164S 

Ar9  Ala  Arq  Lau  Pha  Lau  Cly  Lya  Ila  Lau  Lya  Aan  Och  242 

ACA  CCC  CCA  CTC  TTT  TTA  CCC  AAA  ATT  CTT  AAA  AAT  TAA  CCAAAAT  174  1 

OEf-P 

Mat  Val  Pha  Arq  Lya  Sar  Lau  Lau  Lya  Lau  Ala  11 

TCAAACeXCAAAAA  ATC  CTT  TTT  ACA  AAA  TCT  TTC  TTA  AAC  TTC  CCA  1711 

S.O. 

Val  Pha  Ala  Lau  Cly  Ala  Cya  Val  Ala  Pha  Sar  Aan  Ala  Aan  Ala  26 

CTT  TTT  CCT  CTA  CCT  CCT  TCT  CTT  CCA  TTT  ACC  AAT  CCT  AAT  CCA  111] 

Ala  Clu  Cly  Lya  Lau  Clu  Sar  Ila  Lya  Sar  Lya  cly  Cln  Lau  Ila  4i 

CCA  CAA  CCT  AAA  CTT  CAC  TCT  ATT  AAA  TCT  AAA  CCA  CAA  TTA  ATA  1171 

Val  Cly  Val  Lya  Aan  Aap  Val  Pro  Hia  Tyr  Ala  Lau  Leu  Aap  Cln  S6 

CTT  CCT  CTT  AAA  AAT  CAT  CTT  CCC  CAT  TAT  CCT  TTA  CTT  CAT  CAA  19  2  J 

Ala  Thr  Cly  Clu  Ila  Lya  Cly  Pha  Clu  Val  Asp  Val  Ala  Lya  Lau  71 

CCA  ACA  CCT  CAA  ATT  AAA  CCT  TTC  CAA  CTA  CAT  CTT  CCC  AAA  TTC  19 6i 

Lau  Ala  Lya  Sar  Ila  Lau  Cly  Asp  Aap  Lya  Lya  Ila  Lya  Lau  Val  16 

CTA  CCT  AAA  ACT  ATA  TTC  CCT  CAT  CAT  AAA  AAA  ATA  AAA  CTA  CTT  20 1 J 

Ala  Val  Asn  Ala  Lya  Thr  Arg  Cly  Pro  Lau  Lau  Asp  /an  Cly  Ser  101 

CCA  CTT  AAT  CCT  AAA  ACA  ACA  CCC  CCT  TTC  CTT  CAT  AAT  CCT  ACT  2055 

Val  Aap  Ala  Val  lie  Ala  Thr  Phe  Thr  lie  Thr  Pro  Clu  Arq  Lya  116 

CTA  cat  CCC  CTC  ATA  CCA  ACT  TTT  ACT  ATT  ACT  CCA  CAC  ACA  AAA  ^  2103 

Arg  Ila  Tyr  Asn  Phe  Ser  Clu  Pro  Tyr  Tyr  Cln  Asp  Ala  lie  Cly  131 

ACA  ATT  TAT  AAT  TTC  TCA  CAC  CCT  TAT  TAT  CAA  CAT  CCT  ATA  CCC  2146 

Leu  Leu  Val  Leu  Lya  Clu  Lys  Lya  Tyr  Lys  Ser  Leu  Ala  Asp  Met  146 

CTT  TTC  CTT  TTA  AAA  CAA  AAA  AAA  TAT  AAA  TCT  TTA  CCT  CAT  ATC  2193 

Lya  Cly  Ala  Asn  Ila  Cly  Val  Ala  Cln  Ala  Ala  Thr  Thr  Lys  Lys  161 

AAA  CCT  CCA  AAT  ATT  CCA  CTC  CCT  CAA  CCT  CCA  ACT  ACA  AAA  AAA  22  36 

Ala  Ila  Cly  Clu  Ala  Ala  Lya  Lys  Ila  Cly  Ila  Asp  Val  Lys  Phe  176 

CCT  ATA  CCT  CAA  CCT  CCT  AAA  AAA  ATT  CCC  ATT  CAT  CTT  AAA  TTT  226  3 

Sar  clu  Pha  Pro  Aap  Tyr  Pro  Sar  Ila  Lya  Ala  Ala  Lau  Aap  Ala  191 

ACT  CAA  TTT  CCT  CAT  TAT  CCA  ACT  ATA  AAA  CCT  CCT  TTA  CAT  CCT  2321 

Lya  Arg  Val  Asp  Ala  Pha  Sar  Val  Asp  Lya  Sar  Ila  Lau  Lau  Cly  206 

AAA  ACA  CTT  CAT  CCC  TTT  TCT  CTA  CAC  AAA  TCA  ATA  TTC  TTA  CCT  2473 

Tyr  Val  Aap  Asp  Lya  Sar  Clu  Ila  Lau  Pro  Aap  Sar  Pha  Clu  Pro  221 

TAT  CTC  CAT  CAT  AAA  ACT  CAA  ATT  TTC  CCA  CAT  ACT  TTT  CAA  CCA  24  !• 

Cln  Sar  Tyr  Cly  Ila  Val  Thr  Lya  Lya  Aap  Aap  Pro  Ala  Pha  Ala  236 

CAA  ACT  TAT  CCT  ATT  CTA  ACC  AAA  AAA  CAT  CAT  CCA  CCT  TTT  CCA  2563 

Lya  Tyr  Val*  Asp  Asp  Pha  Val  Lya  Clu  Hla  Lys  Asn  Clu  Ila  Asp  251 

AAA  TAT  CTT  CAT  CAT  TTT  CTA  AAA  CAA  CAT  AAA  AAT  CAA  ATT  CAT  2506 

ORP  E 

Ala  Lau  Ala  Lys  Lys  Trp  Cly  Lau  Och  Mat  Aan  Clu  Sar  Val  5 

CCT  TTA  CCC  AAA  AAA  TCC  CCT  TTA  TAA  T  ATC  AAT  CAA  ACT  CTA  2551 

Cly  Pha  Val  Clu  Mis  Lau  Arg  Cln  Ila  Lau  Thr  Sar  Trp  Cly  Lau  20 

CCT  TTT  CTT  CAA  CAT  TTA  ACA  CAA  ATT  CTT  ACT  TCT  TCC  CCT  TTA  2596 


Tyr  Aps  Clu  Asn  Sar  Ila  Ser  Pro  Pha  Ala  Val  Trp  Lys  Phe  Lau  35 

TAT  CAT  CAA  KKT  ACT  ATA  ACC  CCT  TTT  CCC  CTA  TCC  AAA  TTT  TTA  2641 


Aap  Ala  Lau  Aap  Aan  Lys  Aap  Ala  Pha  lie  Aan  Cly  Phe  lie  Tyr  50 

CAT  CCT  TTC  CAT  AAT  AAA  CAT  CCT  TTT  ATT  AAT  COT  TTT  ATT  TAT  C  2617 


(primers:  5'-GCAGAAGGTAAACTTGAGTCTATT.3'  (bp 
1834-1857)  and  5^TTATAAACCCCATTTTTTCGCTAA-3' 
(complementary  to  bp  2512-2535),  corresponding  to  the  start 
and  end  of  the  sequence  encoding  mature  PEBl).  Under  high 
stringency  conditions,  this  probe  hybridized  to  a  single  1.8- 
kb  /findlll-digested  chromosomal  fragment  from  all  three  C. 
jejuni  strains,  but  not  to  the  other  Campylobacter  strains 


examir^  (Fig.  7).  When  the  same  pair  of  primers  was  used 
in  PC^analysis,  a  702-bp  PCR  product  was  amplified  from 
all  three  C.  jejuni  strains  tested,  as  predicted,  but  horn  none 
of  the  C.  colif^Jlampylobacler  lari,  or  Campylobacter  fetus 
strains  teste^lFig.  8A).  Restriction  digestion  of  the  peblA 
PCR  products  amplified  from  each  of  the  three  C.  jejuni 
strains  demonstrated  identical  patterns  (Fig.  8B),  exactly  as 


C.  jejuni  PEBl,  a  Binding  Component  Homolog 


9lnHtI4] 


LA0(1S] 


AOKKLV-VATOTXrVP-rEr->CQCOKyV-CrOVDt.WAAI-AKE 

skcqLi-vcvkno-vphyali^atceikcfevov-akuJ^s 
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:  :  GF::D: 


CNEMCKR 
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FlO.  4.  Pairwise  alignment  of 
PEBl  with  two  amino  acid-binding 
proteins.  Gap  penalty  •  4,  deletion  pen¬ 
alty  -  5,  and  similarity  is  defined  by 
PAM  250  matrix,  as  described  (57).  In¬ 
dicated  are  identity  (  |  )  and  conserva¬ 
tive  substitution  for  hydrophobic! ty  and 
charge  (:)  and  size  (.).  For  PEBl  versus 
glnH,  there  is  29.3%  identity  in  a  233- 
amino  acid  overlap,  and  for  PEBl  versus 
LAO.  there  is  23.7%  identity  in  a  232- 
amino  acid  overlap. 


-UCLOYEUtPMOrSCII  — PALQTKNVDLALACXTITDERXKAIOFSOCYYK 
*  !  ::  #  i  {Sa  :{!  •:|.:||{  lii:  **1*  !!: 

ILCODKXlKLVAVNAKTRCPLLDHCSVDAVIATrriTPERKRIYNFSEPVVQ 

-M— QVKCTWVASDFOALIPsijtAKXIDAlisSLSITOKRQQEIAFSDKLYA 
:  .  P  L. ..: D  IT  . R: :  .  FS  Y. 


glnH  -S-CLLVHVXANNNOVK-SVKOLDCKWAV-KSCTC — SV-DYA-KA-HIK- 

.  till.  I  I  I.  J;,|  :::).  ::  :  j  '  .'. 

PEBl  DAICLLVE-K— EKKYK-SLADMKCANICVAQAATTKKAICEAA-KKICIO- 
1  ..  :  J::' 

LAO  ADSRLIAA-K— CSPVQPTLESLKCKHVCVLQCST-QEAYANONWRTKCVOV 
.  L:.  K  ..  :  ::  .  :C  ::V  :::T  :  :  .  :  C:0 
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expected  from  sequence  analysis,  indicating  the  high  degree 
of  conservation  of  the  peblA  gene  among  C.  jejuni  strains. 

DISCUSSION 

PEBl,  a  surface-exposed  conserved  antigen  in  C.  jejuni  and 
C.  coli  that  is  commonly  recognized  by  convalescent  sera  from 
infected  patients  and  is  involved  in  the  binding  of  C.  jejuni  to 
eukaryotic  cells,  is  possibly  a  vaccine  candidate  (11-13).  In 
this  study,  we  found  thatpefelA,  the  gene  cloned  using  anti¬ 
body  to  PEBl,  is  a  homolog  of  the  binding  component  in 
bacterial  amino  acid  transport  systems.  Since  a  role  for  amino 
acid  transport  systems  in  bacterial  pathogenesis  has  not  been 
reported  before,  we  sought  to  determine  whether  the  proper¬ 
ties  of  PEBl  and  those  of  the  recombinant  peblA  product  are 
similar.  Although  we  have  not  yet  performed  studies  to  eval¬ 
uate  the  role  of  the  recombinant  peblA  protein  as  a  cell¬ 
binding  factor,  the  following  evidence  establishes  its  identity 
to  the  native  cell-binding  factor  PEBl.  1)  £.  coli  transformed 
with  pPBll9  (containing  peblA)  expressed  a  protein  similar 
in  electrophoretic  migration,  deduced  isoelectric  point,  and 
amino  acid  composition  to  PEBl  from  C.  jejuni.  The  amino- 
terminal  sequence  determined  by  peptide  analysis  of  mature 
PEBl  matches  that  deduced  from  the  peblA  DNA  sequence. 
A  leader  peptide  was  predictable  (and  observed)  since  PEBl 
does  not  have  an  amino-terminal  methionine  and  is  an  ex¬ 
ported  protein.  The  deduced  molecular  mass  of  the  mature 
peblA  product  is  25.5  kDa,  slightly  less  than  that  determined 
by  SDS-poIyacrylamide  gel  electrophoresis  (28  kDa),  which 
could  be  due  to  the  slower  migration  of  a  basic  protein  that 
has  fewer  net  negative  charges  per  residue.  That  the  DNA 
sequence  predicts  Ala  for  the  first  position  of  the  mature 
protein  whereas  amino-terminal  sequencing  showed  Gly  may 
be  artifactual  since  the  chromatographic  behaviors  of  these  2 
amino  acids  during  peptide  sequencing  are  similar.  2)  We 
have  purified  the  recombinant  peblA  protein  to  homogeneity. 


Antibody  to  PEBl  recognized  the  purified  peblA  proteiiv^3) 
We  have  specifically  mutated  the  peblA  gene  from  wild-type 
C.ye/um  strain  81-176  by  allelic  replacement.  IfpeblA  encodes 
a  protein  other  than  PEBl,  mutating  peblA  should  not  affect 
expression  of  PEBl.  Using  immunoblotting  with  antibody  to 
PEBl,  we  found  the  PEBl  band  in  strain  81-176,  but  not  in 
the  isogenic  peblA*  mutant.’ 

Protein  sequence  comparisons  of  PEB 1  indicated  homology 
to  members  in  the  LAO  superfamily,  including  products  of 
the  glnH,  LAO,  and  hisJ  genes.  These  function  as  amino  acid- 
binding  proteins  as  part  of  periplasmic  amino  acid  transport 
systems  in  Gram-negative  bacteria.  Within  the  superfaniily, 
the  hisJ  and  LAO  products  share  the  highest  homology, 
reflecting  their  binding  of  basic  amino  acids  (histidine  for 
hisJ  and  arginine  for  LAO),  and  share  a  common  cellular 
receptor,  hisP  (15).  Similar  to  glnH,  PEBl  exhibits  an  overall 
identity  of  —25%  to  other  superfamily  members.  However, 
other  evidence  strengthens  the  hypothesis  that  PEBl  belongs 
to  this  superfamily.  Mature  PEBl  and  the  other  three  proteins 
all  are  25-26  kDa,  and  all  contain  a  cleaved  amino-terminal 
signal  peptide,  a  high  percentage  of  lysine,  and  conserved  Lys- 
Lys  sequences  near  the  carboxyl  terminus.  The  presence  of  a 
signal  peptide  and  the  absence  of  transmembrane  domains  in 
the  mature  protein  are  consistent  with  each  being  secreted 
beyond  the  cytoplasmic  membrane.  In  the  superfamily,  PEBl 

most  closely  related  to  glnH  in  both  sequence  similarity 
and  hydrophobicity  distribution,  especially  between  residues 
58  and  163. 

In  several  periplasmic  binding  protein-dependent  transport 
systems  (14,  41,  51-53),  the  binding  protein  and  the  mem¬ 
brane-associated  components  are  each  encoded  in  the  same 
operon  containing  three  or  four  structural  genes.  Genomic 
organization  of  ORFs  A-E  in  pPBlI9  indicates  that  the  2.6- 
kb  insert  contains  two  partial  operons  separated  by  a  noncod¬ 
ing  region  between  ORFs  B  and  C.  The  putative  transcrip- 

’  Z.  Pei  and  M.  J.  Blaser,  unpublished  data. 


Fio.  5.  Homology  between  ORF  C 
and  glrxQ  and  hisP.  On  the  consensus 
line,  upper-case  Utters  represent  residues 
conserved  in  all  three  molecules  at  that 
position,  and  lower-case  letters  represent 
residues  conserved  in  two  of  the  three 
molecules. 


Fic.  6.  Homology  between  ORFs 
A  and  B  and  mouse  brain  microtu¬ 
bule-associated  protein  (MAPIB) 
(47).  Parameters  used  were  gap  penalty 
“  1,  gap  size  penalty  0.05,  and  joining 
penalty  «  20. 
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Fig  7.  Southern  hybridization  showing  conservation  of 
peblA  gene  in  C.  jejuni  chromosomal  DNA  digested  with 
ffindill.  A  702-bp  PCR  product  corresponding  to  the  DNA  sequence 
of  mature  PEBl  was  used  as  probe.  Lane  a-c,  C.  jejuni  strains  81- 
176,  85-H.  and  81-95,  respectively;  lanes  d  and  e,  C,  coli  strains  D126 
and  D730,  respectively;  lanes  /  and  g,  C.  fetus  strains  23D  and  84-91, 
respectively.  Molecular  size  markers  (in  kilobases)  are  shown  to  the 
left. 
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Fig.  S.  A.  PCR  amplification  of  702-bp  peblA  fragment  from 
Campylobacter  strains.  Lane  a-c,  C.  jejuni  strains  81-176,  D1916,  and 
85AC,  respectively;  lanes  d  and  e,  C.  coli  strains  D126  and  D1035, 
respectively;  lanes  f  and  g,  C.  lari  strains  DllO  and  D67,  respectively; 
lane  h,  C.  fetus  strain  23D;  lane  i,  E.  coli  with  pPB119.  A  102-bp  PCR- 
amplified  product  was  found  in  all  C.  jejuni  strains  {arrow),  but  not 
in  the  other  Campylobacter  species.  B,  restriction  pattern  of  702-bp 
PCR  products  from  C.  jejuni  strains.  The  702-bp  PCR  products  were 
undigested  or  were  digested  with  SspI  or  Hoelll.  Strain  81-176  is 
shown  in  lanes  a  and  d  (undigested)  and  c  (Sspl-digested),  strain 
D1916  in  lanes  b  (undigested)  and  a  and  d  (Sspl-digested),  and  strain 
85AC  in  lanes  c  (undigested),  b  (SspI -digested),  and  a  (/faelll-di- 
gested).  SspI  cleaved  the  702-bp  PCR  products  from  each  strain  into 
370-,  173-,  and  159-bp  fragments,  and  Haelll  cleaved  the  PCR  prod¬ 
ucts  from  each  strain  into  499-  and  203-bp  fragments,  indicating  that 
thepebiA  gene  is  highly  consc.-.  cd  in  C.  jejuni. 


tional  terminator  in  this  region  indicates  a  potential  3' -end 
of  the  operon  containing  ORFs  A  and  B.  The  putative  pro¬ 
moter  just  downstream  indicates  the  5' -end  of  the  operon 
containing  ORFs  C,  D  (peblA),  and  E.  Support  for  this 
hypothesis  includes  the  following.  1)  The  putative  promoter 
is  the  only  promoter  found  upstream  from  ORFs  C-E,  which 
could  be  responsible  for  focZ-independent  transcription  of 
ORF  D  in  pPB219;  2)  there  is  no  transcriptional  terminator 
identified  between  ORFs  C  and  D  and  between  ORFs  D  and 
E;  and  3)  in  E.  coli  and  S.  typhimurium,  the  genes  homologous 
to  ORFs  C  and  D  are  randomly  aligned  and  cotranscribed 
(14-16).  A  putative  function  of  ORF  E  is  unknown  at  present. 

Although  the  homology  between  PEBl  and  amino  acid¬ 
binding  proteins  is  significant,  PEBl  is  unique  since  all  iden¬ 
tified  amino  acid-binding  proteins  in  bacterial  transport  sys¬ 
tems  are  located  in  the  periplasmic  space,  whereas  PEBl  is 
exposed  on  the  bacterial  surface  (12).  Members  of  the  LAO 
superfamily  all  contain  a  leader  peptide  processed  by  signal 
peptidase  I,  which  enables  these  molecules  to  cross  the  cyto¬ 
plasmic  membrane.  PEBl  also  has  such  a  processing  site:  as 
demonstrated  by  both  the  characteristics  of  the  deduced  PEBl 
sequence  and  the  evidence  for  cleavage  at  this  site  in  C.  jejuni 
(11).  In  addition,  PEBl  contains  a  putative  signal  peptidase 
II  cleavage  site.  Signal  peptidase  II  processes  lipoprotein 
precursors  acylated  on  the  free  sul.fiiydryl  group  of  cysteine 
by  cleavage  of  the  peptide  bond  at  the  amino-terminal  side  of 
cysteine.  The  acyl  chains  of  the  lipoprotein  often  anchor  the 
polypeptide  to  membranes  (54).  Nearly  all  secreted  proteins 
in  bacteria  have  only  one  signal  peptidase  cleavage  site  proc¬ 
essed  by  either  signal  peptidase  I  or  II.  However,  the  endog- 
lucanase  precursor  of  Pseudomonas  solanacecrum  has  a  signal 
sequence  of  45  residues  with  two  processing  sites  {55).*The 
endoglucanase  is  modified  by  fatty  acylation  at  Cys”,  cleaved 
by  signal  peptidase  II,  and  exported  across  the  inner  mem¬ 
brane.  The  lipoprotein  intermediate  is  then  cleaved  at  the 
signal  peptidase  Mike  site  (Ala**-Ala“)  during  export  across 
the  outer  membrane.  Since  PEBl  also  contains  two  signal 
peptidase-processing  sites,  the  location  of  PEBl  on  the  bac¬ 
terial  surface  suggests  that  C.  jejuni  could  utilize  similar 
mechanisms  to  export  this  protein  across  both  the  cytoplasmic 
and  outer  membranes.  If  this  hypothesis  is  true,  we  would 
expect  that  mature  PEBl  is  not  a  lipoprotein  since  the  final 
cleavage  at  Ala*'  eliminates  any  amino  acid  residues  located 
between  residues  1  and  26,  including  the  putatively  acylated 
Cys”.  This  hypothesis  was  supported  by  the  evidence  that 
mature  PEBl  begins  at  Ala”  and  that  amino-terminal  se¬ 
quencing  was  not  blocked  by  lipid  (11). 

VVe  speculate  that  the  major  cell-binding  factor  PEBl  may 
have  a  common  evolutionary  origin  with  periplasmic  amino 
acid-binding  proteins,  from  which  PEBl  gains  the  binding 
capacity.  Since  the  genome  of  C.  jejuni  is  only  half  the  size  of 
that  of  E.  coli  or  Salmonella  (56),  C.  jejuni  may  use  particular 
proteins  for  multiple  purposes.  Two-step  cleavage  of  the  PEBl 
leader  peptide  may  distinguish  it  from  these  amino  acid¬ 
binding  proteins  and  make  it  accessible  to  the  bacterial  sur¬ 
face  to  perform  cell  binding  functions  as  well. 

The  codon  usage  for  PEBl  shows  strong  third-position  AT 
preference;  consequently,  Arg,  Asn,  Cys,  Gin,  His,  and  Tyr 
are  single-codon  amino  acids,  which  has  been  previously  ob¬ 
served  for  C.  jejuni  serine  hydroxymethyltransferase  (glyA) 
(48).  Knowledge  of  this  phenomenon  will  be  helpful  in  design¬ 
ing  oligonucleotide  probes  to  clone  other  C.  jejuni  proteins  for 
which  antibody  probes  are  not  available. 

ORF  C  begins  with  TTG,  which  although  not  common,  has 
been  previously  identified  (30,  31,  41,  49,  50).  The  ORF  C  and 
gInQ  and  htsP  gene  products  are  highly  homologous,  and  their 
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ORFs  also  begin  with  uncommon  initiation  codons  (14,  15). 
Exparimental  replacement  of  TTG  with  ATG  led  to  a  2.5- 
3.7-fold  increase  in  protein  translation  (49,  50).  The  low 
translation  efficiency  when  TTG  is  the  initiating  codon  sug- 
a  possible  conser  ::^  mechanism  for  control  of  expres¬ 
sion  of  this  family  of  romologous  proteins. 

Although  both  C.  jejuni  and  C.  coli  strains  contain  PEBl 
homologs  (Hi,  .<sing  primers  corresponding  to  the  amino  and 
carboxyl  termini  of  mature  PEBl.  we  PCR-ampIified  the 
exp'cled  fragment  from  C.  jejuni  strains,  but  not  from  C.  ccli 
..trains.  Amino-terminal  differences  in  the  C.  jejuni  and  C. 
coli  PEBl  homologs  (11)  correlate  with  the  failure  of  both 
amplification  and  hybridization  of  thepeblA  gene  with  C.  coli 
strains.  In  contrast  to  C.  coli  strains,  the  primary  sequence  of 
peblA  must  be  highly  conserved  among  C.  jejuni  strains,  as 
shown  by  the  conserved  PCR  product  restriction  profiles, 
again  indicating  its  importance  to  the  organism. 

Acknowledgment — We  thank  Charlene  Oakley  for  typing  this  man¬ 
uscript. 
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